Aerial Monocular 3D Object Detection

نویسندگان

چکیده

Drones equipped with cameras can significantly enhance human's ability to perceive the world because of their remarkable maneuverability in 3D space. Ironically, object detection for drones has always been conducted 2D image space, which fundamentally limits understand scenes. Furthermore, existing methods developed autonomous driving cannot be directly applied due lack deformation modeling, is essential distant aerial perspective sensitive distortion and small objects. To fill gap, this work proposes a dual-view system named DVDET achieve monocular both space physical address severe view issue, we propose novel trainable geo-deformable transformation module that properly warp information from drone's birds' eye (BEV). Compared cars, our includes learnable deformable network explicitly revising deviation. dataset challenge, new large-scale simulation AM3D-Sim, real-world AM3D-Real high-quality annotations detection. Extensive experiments show i) feasible; ii) model pre-trained on helps performance; iii) DVDET also cars. encourage more researchers investigate area, released related code.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Monocular Object Detection Using 3D Geometric Primitives

Multiview object detection methods achieve robustness in adverse imaging conditions by exploiting projective consistency across views. In this paper, we present an algorithm that achieves performance comparable to multiview methods from a single camera by employing geometric primitives as proxies for the true 3D shape of objects, such as pedestrians or vehicles. Our key insight is that for a ca...

متن کامل

3d Object Reconstruction from Aerial Stereo Images

Among the variety of problems in providing 3D information about topographic objects, efficient data collection and model construction are issues for research. The efforts are directed towards improving the tedious, time and man-power consuming process of data generation by applying automation. In this paper, we present a semi-automatic method for acquiring 3D topologically structured data from ...

متن کامل

Monocular Vision-Based Underwater Object Detection

In this paper, we propose an underwater object detection method using monocular vision sensors. In addition to commonly used visual features such as color and intensity, we investigate the potential of underwater object detection using light transmission information. The global contrast of various features is used to initially identify the region of interest (ROI), which is then filtered by the...

متن کامل

Robust 3D Object Tracking from Monocular Images using Stable Parts.

We present an algorithm for estimating the pose of a rigid object in real-time under challenging conditions. Our method effectively handles poorly textured objects in cluttered, changing environments, even when their appearance is corrupted by large occlusions, and it relies on grayscale images to handle metallic environments on which depth cameras would fail. As a result, our method is suitabl...

متن کامل

Monocular Multiview Object Tracking with 3D Aspect Parts

In this work, we focus on the problem of tracking objects under significant viewpoint variations, which poses a big challenge to traditional object tracking methods. We propose a novel method to track an object and estimate its continuous pose and part locations under severe viewpoint change. In order to handle the change in topological appearance introduced by viewpoint transformations, we rep...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE robotics and automation letters

سال: 2023

ISSN: ['2377-3766']

DOI: https://doi.org/10.1109/lra.2023.3245421